$2
Model HF Main Model Name HF Draft Model Name (speculative decoding) Size Format API GPU GPU
$323
$28
$6132
$8